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1. INTRODUCTION 


Label Identification from Statistical Tabulation (LIST) Is a stmlautomated 
labeling technology developed during the Large Area Crop Inventory Experiment 
(LACIE) conducted at the National Aeronautics and Space Administration (NASA) 
Lyndon B. Johnson Space Center (JSC) In Houston, Texas. A complete descrip- 
tion of this procedure can be found In reference 1. Basically, this procedure 
transforms analyst responses to a set of questions about a picture element 
(pixel) and the raw spectral data (four 4-by-l vectors for each pixel) Into a 
set of features and then applies a linear discriminant to these features to 
obtain a label of small grains or nonsmall grains. To develop the features 
used in LIST, the analyst responses to the pixel-level questions were assigned 
numerical values In such a way that smaller, assigned values were more char- 
acteristic of small grains. The automated features, derived from the spectral 
data, were also designed to be correlated with the probability of the presence 
of small grains. In this transformed feature space, the theoretically Ideal 
discriminant boundary would be obtained with a discriminant vector, certain 
components of which were all nonnegative. However, in the LIST application, 
the discriminant vector did not have this desired property. 

A literature search yielded a suggestion that the arbitrary assignment of 
values to each response be replaced by an assignment based on the distribution 
of all observed responses. (See Levine et al., ref. 2.) The transformation, 
called a RIOIT transformation, will be discussed in section 2. The name RIOIT 
was chosen by Bross (ref. 3) for this transformation. The first three letters 
are an acronym for "relative to an identified distribution," while the last 
two letters were chosen to form an analogy with "Probits" and "Logits." In 
addition to the RIDIT transformation, a discriminant Is introduced in refer- 
ence 2 to be used in conjunction with the transformation. This discriminant 
and others will be discussed in section 3. In section 4, the results of 
several discriminant analyses are shown, and conclusion and recommendations 
are given in section 5. 
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2. THE RIO IT TRANSFORMATION 

The use of RIO IT's was Introduced by Bross (ref. 3) and has appeared In many 
applications In the field of epidemiology (refs. 4 to 6). In reference 2, 
Levine et al. propose the use of RIOIT's in analyzing questionnaires. The use 
of RIOIT's applies when the difference In responses to a questionnaire is 
related to two classes within the data. The responses to each question must 
be ordered monotonically so that an Increase in the response Is related to the 
probability of membership In one of the two classes. The RIOIT transformation 
then replaces the arbitrary assignment of a value to a given response with an 
assignment of values based on the distribution of all responses to the ques- 
tion. To formulate the transformation, let X denote a response to the 
question of the questionnaire. Let P|^ (Z < X) denote the probability that a 
response to question K .s less than X and likewise let P|^ (Z > X) denote the 
probability that a response to question K is greater than X. The transformed 
value of the response X to question K is then given by the formula 

T|^(X) « P,j(Z < X) - P|((Z > X) 

■ fk(*) - “ M Fk(* - 

where F|^ is the cumulative distribution for the responses to question K. In 
practice, empirical estimates of these distributions are used to obtain the 
RIDIT transformation. 

Data which are transformed by this method have the following properties. 

a. The expected value of the transformed responses to each question, 

ECfk(X)], Is zero, and hence. If all responses to a given question are 
Identical, the transformed value Is zero and cannot contribute to a dis- 
criminant score. Another consequence of this property is the fact that 
zero lies between the means of the two transformed classes. To prove this 
assertion for the discrete case, suppose X takes on the n discrete values 
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Zj < Z 2 < ••• < Z„ with frequency Wj, W 2 , 
T|((X) takes on the values 



• • • 


Wp 


, Wp, respectively. 


Then 


with frequency Wj for j • 1, 2, ••• , n. Thus, the expected value of T|^X is 




W_ - 


t 

p-j+1 





b. If two original sets of data with arbitrary value assignments are equiva- 
lent in the sense that there is an order-preserving function from one onto 
the other, then the transformed values of the two sets of responses will 
be identical. For example, if s data set contained responses of 0, 1, 2 
for a particular question, and the value 1 was reassigned as 3, and the 
value 2 was reassigned as 7, then, after applying the RIDIT transforma- 
tion, the two data sets (one coded 0, 1, 2 and the other coded 0, 3, 7) 
would yield the same RIDiT coding. 


c. If all original responses, X, in a given interval A < X < B are reassigned 

the single response A * B (or any number between A and B), then the trans- 

2 

formed values of responses outside the interval A < X < B are not changed. 

d. The RIDIT transformation is itself order-preserving. 


Other properties of this transformation are given by Brockett and 
Levine (ref. 4). 


Two methods are available for the application of a RIDIT transformation to the 
LIST data set depending on how the distributions, F|((X), are derived. One 
method is to pool a collection of training segments to obtain a transformation 
to be applied universally. The other method is to treat each segment 
independently and derive a separate RIDIT transformation to be applied to that 
segment. Tests were conducted to determine the discriminabil ity after 
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applying each of these methods, and comparisons were made to results obtained 
with the original data. These results are presented In section 4. 


3. APPLICATION OF DISCRIMINANTS 

In reference 2, Levine et al. describe a method for computing a discriminant 
to be used on the transformed data. Briefly, the method Is the following: 

a. Let A denote the matrix of transformed data; I.e., A^j Is the RIDIT value 
of the response to question j for the 1^^ respondent (pixel 1). 

b. Let Zq be the vector with the same dimension as the number of questions 
and with each component equal to the number 1. 

c. Define a sequence of vectors, Zj, recursively by the formula 

a'^’az, 

Z 

IIA^AZ^M 

where ||.|| is the usual euclidean norm. It 1s well known that the sequence 
Zj will converge to the eigenvector corresponding to the largest eigenvalue of 
A^A when the multiplicity of that eigenvalue is 1. Since the RIDIT transfor- 
mation has the property that each column has a mean of zero, A^^A is merely the 
covariance matrix of the transformed data. Thus, the discriminant plane 
determined by this method is orthogonal to the direction of greatest variance 
in the data. 

When no information is available concerning the underlying classes in the data 
and with the knowledge that the variance In the data Is related to class 
membership, this discriminant seems reasonable. In applications involving the 
LIST data, however, ground truth observations were available. Thus, conven- 
tional discriminations were trained on the transformed data, and their per- 
formance was compared to the performance of the above converging routine and 
to the direct use of the major eigenvector. With the data sets available in 
this study, no example of the covariance matrix having a principal eigenvalue 
with multiplicity greater than 1 has been found. 



4, RESULTS 


To determine the applicability of RIDIT's to LIST data, two tests were 
conducted. The first test was to compare test and training discriminant 
accuracies obtainable with the normal data and with data transformed by the 
two RIOIT transformations discussed earlier. The discriminant used In this 
test was the quadratic discriminant (weighted by priors) which Is available In 
the Statistical Analysis System (SAS, ref. 7). The transition ye'"* LIST data 
used In the test are comprised of 24 blind sites from North Dakota, South 
Dakota, and Minnesota. (See table 1.) In each discriminant run, the first 16 
segments were used for training and the remaining 8 for testing. The result* 
Ing test and training accuracies are presented In table 2. 

The second test was conducted to compare the accuracy obtained using the SAS 
discriminant on the two sets of transformed data obtained In the first test 
with the accuracy obtained using Levine's converging routine and the accuracy 
of using the major eigenvector directly. First, the data obtained by applying 
a separate RIOIT transformation to each segment were considered. For each 
segment, a separate covariance matrix was obtained from the transformed data. 
The major eigenvectors were extracted from SAS routines, and Levine's algo- 
rithm was applied to approximate the major eigenvector. The data were then 
projected onto each of these vectors and the resulting one-dimensional data 
sets were submitted to the SAS discriminant procedure to determine an optimal 
decision boundary for each se^nent. The accuracies obtained for each segment 
are presented In table 3, along with the means and standard deviations of the 
accuracies of each method. 

Next, the separately transformed segments were grouped Into training and test 
sets using the first 16 segments from table I as a training set and the 
remaining 8 as a test set. A covariance matrix was obtained from the training 
data. The major eigenvector and Levine's approximation were computed as 
before, and the training and test data were projected onto these vectors. The 
SAS discriminant procedure was applied to the projected training set to obtain 
the appropriate decision boundaries. The decision boundaries were then 
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TABLE 1.- LANDSAT DATA SEGMENTS, COUNTIES, AND ACQUISITIONS 
WHICH COMPRISE THE TRANSITION YEAR LIST DATA 


Segment 

County and State 

Acquisitions 

Training set 

1380 

HvcKwood, Minnesota 

78115, 78169, 78204, 78222 

1394 

Burke, North Dakota 

78120, 78174, 78228, 78264 

1457 

Ward, North Dakota 

78174, 78228, 78246, 78264 

1461 

Pierce, North Dakota 

78137, 78190, 78217, 78236 

1472 

Barnes, North Dakota 

78117, 78135, 78216, 78243 

1473 

Cass, North Dakota 

78116, 78197, 78207, 78251 

1518 

Roseau, Minnesota 

78135, 78188, 78224, 78243 

1566 

Grant, Minnesota 

78133, 78169, 78196, 78232 

1584 

Pembina, North Dakota 

78117, 78198, 78216, 78243 

1602 

Montrail, North Dakota 

78174, 78211, 78228, 78264 

1612 

McHenry, North Dakota 

78137, 78155, 78199, 78236 

1619 

Grandforks, North Dakota 

78135, 78207, 78243, 78252 

1636 

Stutsman, ^rth Dakota 

■ 136, 78154, 78217, 78243 

1650 

Hettinger, North Dakota 

78156, 78209, 78218, 78246 

1658 

Dickey, North Dakota 

78117, 78135, 78207. 78252 

1668 

Perkins, South Dakota 

7815o, 78174, 78228, 78264 

Test set 

1676 

Brule, South Dakota 

78135, 78207, 78224, 78234 

1755 

Jerauld, South Dakota 

78117, 78153, 78197, 78225 

1909 

Kidder, North Dakota 

78136, 78154, 78208, 78217 

1918 

Grant, North Dakota 

78137, 78209, 78236, 78263 

1656 

Mormon, North Dakota 

78137, 78155, 78209, 78263 

1825 

No*Tnan, Minnesota 

78133, 78169, 78196, 78232 

1842 

Yellow Medicine, Minnesota 

78133, 78205, 78223, 78241 

1784 

Minn'haha, South Dakota 

78134, 78169, 78196, 78223 


6 




TABLE 2.- A COMPARISON OF TEST AND TRAINING ACCURACIES OBTAINED 
WITH CONVENTIONAL AND TRANSFORMED DATA 


Probability of correct labeling 


Type of data set used 

Training samples 

Test samples 

Normal LIST transition year data 

72.32 

66.64 

LIST transition year data with the entire 
training set used to develop the RIDIT 
transformation 

77.44 

67.85 

LIST transition year data with the RIDIT 
transformation derived and applied 
Independently on each segment 

75.75 

77.10 









applied to the projected training and test sets to obtain training and test 
accuracies. These accuracies are presented In table 4. Also presented In 
table 4 are the accuracies obtained by repeating this above procedure starting 
with the data for which one RIDIT transfomnatlon was computed from the 16 
training segments and applied to each segment. These results will be 
discussed In more detail In the next section. 

5. CONCLUSIONS AND RECOMMENDATIONS 

The Identical test and training accuracies in table 4 occurred because of poor 
separation of the data along the direction of the discriminant vector. The 
separation was so poor, In fact, that the best discriminant decision boundary 
was placed so that all training samples (and test samples) would be ciassifled 
Into the predominant class, nonsmall grains. 

This phenomena also occurred In the application to Individual segments pre- 
sented In table 3. There was also a great deal of variance noted In the 
Individual eigenvectors generated for each segment, and the small-grains class 
was not projected in a way that would lead to a unified decision rule. 

Table 5 lists the major eigenvectors generated for five segments, and figure 1 
gives two extreme examples of the distribution of the data along the major 
eigenvectors. Because of this poor separation, the use of the major eigen- 
vector as a discriminant vector is not recommended. 

The RIDIT transformation computed Independently for each segment does, 
however, appear to hold promise for use as a data normalization technique. 

Note In table 2 that the test accuracy is significantly Improved using the 
data where a separate RIDIT transformation was applied to each sepent. There 
are examples In the LIST data of segments In which one of the LIST responses 
Is constant across the sepent. When no RIDIT transformation Is performed, 
each of these responses Is then weighted In comparison to the entire data set, 
and some must contribute to erroneous discriminant scores. The same principle 
holds when the response Is compared to the entire set of responses to obtain a 
unified RIDIT transformation. However, If a separate RIDIT transformation Is 
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TABLE 4.- ACCURACY OF USING THE MAJOR EIGENVECTOR FOR DISCRIMINATION 
ON UNIVERSALLY TRANSFORMED DATA SETS 

[Computed from pooled data sets] 



Probability of correct labeling 

Method of obtaining 
RIDIT transformation 

Major eigenvector 

Levine's method 

Training data 

Test data 

Training data 

Test data 

Each segment trans- 
formed independently 

64.46 

83.31 

64.46 

83.31 

One RIDIT transfor- 
mation computed from 
training segments 

64.46 

83.31 

64.46 

83.31 



TABLE 5.- LISTING OF MAJOR EIGENVECTORS 
GENERATED FOR FIVE SEGMENTS 


Major eigenvectors by segment 


1612 

1619 

1636 

1650 

1658 

-0.02211 

-0.0607 

-0.15535 

-0.0497 

0.236357 

-.03654 

-.1967 

-.30945 

-.10001 

.196131 

-.14804 

-.20283 

.071697 

-.0322 

.135674 

-.05088 

-.17429 

.087242 

.025475 

.040506 

-.27146 

.019571 

-.29896 

-.24965 

.263358 

-.27039 

-.24168 

-.3547 

-.24342 

.292772 

-.26766 

-.27172 

.106559 

-.25154 

.191614 

-.25388 

-.24932 

-.08914 

-.31063 

.217026 

.181374 

.218742 

.091025 

.176134 

.097652 

.315556 

.30019 

.341235 

.374018 

-.34074 

.178593 

.230127 

.071158 

.192524 

.125144 

.322089 

.303404 

.351265 

.361622 

-.32461 

. 160846 

-.03046 

.264795 

.100246 

.19588 

.160148 

.208172 

.088044 

.099044 

-.09507 

.151644 

.199183 

.011309 

.080798 

.133114 

.144349 

.163862 

.074912 

-.0635 

.148612 

.073459 

.107764 

-.14005 

-.01276 

.217098 

.092598 

.02344 

.067957 

.147505 

.217098 

.192346 

.16167 

.093618 

.008633 

.173433 

.070835 

.192986 

.111979 

.062737 

-.03686 

.254892 

.174262 

.294942 

.174891 

-.26336 

.265028 

-.03243 

.33591 

.249916 

-.27712 

.267663 

.243377 

.194668 

.235427 

-.17985 

.253922 

.262672 

.249733 

.113267 

.251545 

.312498 

-.21703 
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Figure 1.- Examples of distributions of classes 
(0 = small grains and 1 = nonsmall grains) 
projected onto tiic mSjor eigenvector. 
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computed for that segment, then the transformed value of the channel in ques 
tion Is zero for each pixel in that segment. Thus, that particular channel 
does not contribute to the discriminant score for the pixels in that segment 
but may indeed be applicable to pixels in other segments. The use of the 
RIDIT transformation in conjunction with conventional discrimination tech- 
niques seems to be justified under these circumstances. 
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